Method and apparatus for bit operational process

ABSTRACT

A bit operation processor having a first address operation unit for updating the address of data in units of byte or multipled bytes for performing operation in units of byte or multiple of bytes, a second address operation unit for updating the address of data in units of bit or multiple of bits, an address control means operating on the first address operation unit to advance the address in response to the result of address advancement by the second address operation unit, and means for fetching byte-wide data for operation as addressed by the first address operation unit, whereby operation between data of any number of bits at any positions in byte blocks is controlled simply and fast.

This is a continuation of application Ser. No. 08/123,087, filed Sep. 17, 1993 now abandoned; which is a Continuation of application Ser. No. 07/836,738, filed Feb. 19, 1992, now U.S. Pat. No. 5,265,204; which is a Continuation of application Ser. No. 07/641,064, filed Jan. 14, 1991, which issued as U.S. Pat. No. 5,175,816 on Dec. 29, 1992; which is a Continuation of parent application Ser. No. 07/779,794, filed Sep. 24, 1985 which issued as U.S. Pat. No. 5,034,900 on Jul. 23, 1991.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a digital processing system and, particularly, to a method and apparatus for bit operational process suitably used in an image processing system having a bit-map display.

2. Background

The conventional system will first be described by using an example of image processing shown in FIG. 1. In the figure, reference symbol M1 denotes a memory area storing image data in 1-to-1 correspondence to a CRT (Cathode Ray Tube) screen, M2 denotes a memory area storing image data to be added to the image data in M1, XA and XB denote partial areas in M1 and M2, respectively, for which image data processing takes place, W_(A0), W_(A1), W_(A2), W_(B0) and W_(B1) denote boundaries of data words having a word length of 16 bits, for example, R₀ through R_(m) represent raster lines for the partial areas X_(A) and X_(B), na and nb represent displacements of the leading edges of the areas X_(A) and X_(B) from the word boundaries W_(A0) and W_(B0), respectively, A₀ through A_(n) and B₀ through B_(n) represent addresses of word data in the areas X_(A) and X_(B) , and MFY denotes a modification unit implementing the alignment and processing for the areas X_(A) and X_(B) having different starting bit positions na and nb.

Since the currently available processing unit such as a microprocessor deals with data and makes access to the memory in units of a word or a byte, the memory areas M1 and M2 shown in FIG. 1 have a word or byte structure. However, in image processing, a partial screen area to be processed is specified the outside of the system without regard to the word boundary as shown by areas X_(A) and X_(B) in FIG. 1. On this account, image processing for combining the partial areas X_(A) and W_(B) needs a modification unit MFY with the following three processing functions.

(1) Rearrangement of word data so that processing can take place on a word-wide basis between data for areas X_(A) and X_(B) with different starting bit positions na and nb.

(2) Separation of data section from word-wide data e.g., na bits, in each of addresses A₀, A₃, . . . , A_(n−2) so that it is retained unchanged in the processing.

(3) Data processing in any specific number of bits (bit width) so that monochrome display is implemented using one bit per pixel while color display uses a plurality of bits per pixel (generally four bits per pixel).

The operation of the modification unit having these functions will be described in connection with FIG. 2. Throughout the following description, it is assumed that the image data memory is addressed in units of a word.

FIG. 2 shows a 2-word register SRC(A) and SRC(B) for storing data read out of the processing area X_(B), a 2-word register DST(A) and DST(B) for storing data read out of the processing area X_(A), and a 2-word register DST(A) and DST(B) for storing the result of processing for the contents of the registers SRC(A, B) and DST(A, B). The modification unit MFY performs rotation of the register SRC(A, B), i.e., shift of SRC content with bit 0 of SRC (A) linked with bit F of SRC(B), depending on the values of SN (i.e., nb) and DN (i.e., na) representing the starting bit positions of the processing areas X_(A) and X_(B), as follows.

(a) For SN>DN: Rotate the SRC content left by a number of bits of SN-DN.

(b) For SN<DN: Rotate the SRC content right by a number of bits of DN-SN.

(c) For SN=DN: No operation.

In this way, bit addresses nb(SN) and na(DN) are used to align the operation starting bit position.

Consequently, the starting bit position of the SRC content is adjusted to that of the DST content. The bit width of processing, WN, is set in advance, and the remaining portion of data is left unchanged. Although in FIG. 2 the result register MRG(A, B) is provided independently of DST(A, B), they may be arranged in common. After the subsequent processing, the original bit position of the SRC content is restored automatically.

Next, the 4-bit image processing for the areas X_(A) and X_(B) by the modification unit MFY will be described in connection with FIGS. 3, 4, 5 and 6. The process shown in FIG. 3 includes step S1 of setting the starting address A₀ for the processing area X_(A), step S2 of setting DN to the starting bit position (address) na, step S3 of setting the starting address B₀ for the processing area X_(B), step S4 of setting SN to the starting bit position (address) nb, step S5 of the process implemented by the modification unit MFY mentioned above, steps S6-S9 for the area X_(B) for obtaining the next bit address (S6), setting the next SN (S7), incrementing the address in word units (S8) and reading next word data (S9), and steps S10-S14 for the area X_(B) for obtaining the next bit address (S10), setting the next DN (S11), writing the result of process in the register MRG(A) (S12), incrementing the address in word units (S13) and reading the next word data (S14). The process further includes decision steps SB1 and SB2, which implement the following operations.

(I) Decision step SB1

This step tests as to whether the next SN address of SRC resulting from the steps S6 and S7 as in the following expression (1) reaches beyond the word boundary as in the following expression (2), and controls the sequence to fetch the next word data when the condition (2) is met.

SN=SN+WN  (1)

SN≧(10)_(HEX)  (2)

(II) Decision step SB2

This step tests as to whether the next DN address of DST resulting from the steps S10 and S11 reaches beyond the word boundary as in the following expression (3), and controls the sequence to write data in the register MRG(A) to the area X_(A) when the condition (3) is met, which indicates the end of operation at the current word boundary.

DN≧(10)_(HEX)  (3)

The above operations for one raster (R0) will be described in more detail in connection with FIGS. 4, 5, and 6.

FIG. 4 is the case of condition, DN + WN = (A)_(HEX) + (4)_(HEX) < (10)_(HEX)

Then, reading of the next word data and writing of process result do not take place.

FIG. 5 is the case of condition, DN + WN = (E)_(HEX) + (4)_(HEX) > (10)_(HEX)

Then, reading of the next word data and writing of process result take place.

FIG. 6 is the case of condition, SN + WN = (D)_(HEX) + (4)_(HEX) > (10)_(HEX)

and  DN + WN = (2)_(HEX) + (4)_(HEX) < (10)_(HEX)

Then, reading of the next SRC word data takes place, but writing of the process result does not take place.

The foregoing prior art processing system involves the following drawbacks.

(1) The conventional microprocessor of word addressing type needs register rotation and word boundary check by software in implementing bit block operations, resulting in a complex system control.

(2) Fetching of data from the processing areas X_(A) and X_(B) needs different access timing depending on the current bit position with respect to the word boundary, resulting in a complex software control.

(3) The amount of data stored in the memory areas M1 and M2 will range as much as from 100 kilo-bytes to several mega-bytes, and the process shown in FIG. 3 with the bit width WN being set as large as one byte (8 bits) will take a total number of steps of the order of 10⁶, and therefore the number of processing steps needs to be reduced drastically.

Furthermore, the conventional microprocessor merely allows bit operations such as arithmetic shift, logical shift, bit set, bit reset, etc., but as to other arithmetic and logic operations, etc., it is impossible to carry out the operations except only in a fixed bit length such as a byte or word. On this account, in order to achieve “raster operation” on a bit-map display having a memory in correspondence at each point of on/off control to the display screen for implementing an image process between separate rectangular areas of arbitrary size on the screen, the above-mentioned bit operations do not suffice the purpose, but operations of data with any bit width at any position in each word becomes necessary. If such operations are intended to be performed using a microprocessor, input data is shifted for bit alignment, an operation is conducted on the data, the resultant data is shifted for alignment with another data to be merged, and after the merging operation the resultant data is stored in the original memory location. These sequential operations take too long a time, and fast image processing cannot be expected.

There is a method of solving this problem, in which there is added to the system a barrel shifter that is capable of multi-bit shift at the same operating speed as the single-bit shift, and a merging circuit. However, despite the capability of bit alignment by the barrel shifter, the processor is limited to fixed word-length operations and external memory access usually in 8-bit or 16-bit length, and the restricted hardware ability for implementing arbitrary bit width operations need to be covered by complex software processes through the use of simple bit operations. An example of the processor for implementing the foregoing operations is Micro Processor, model Am 29116, manufactured by ADVANCED MICRO DEVICES.

SUMMARY OF THE INVENTION

It is an object of this invention to provide a method and apparatus for bit operation with the intention to simplify and speed up the computation between data with arbitrary number of bits at arbitrary position in each data word.

Another object of this invention is to provide a bit processing system capable of easily accessing an external data memory under word address control for the internal processing under bit address control.

Still another object of this invention is to provide a bit operation unit capable of easily executing an arithmetic and logic operation for bits with any number of bits at any position in each data word.

A further object of this invention is to provide a bit operation processing system capable of bit and word address control and external memory access control on a hardware basis so that the overhead software processing is reduced to enhance the system performance.

In order to achieve the above objectives, this invention has features as follows.

(1) Internal computations are controlled entirely on the basis of bit addressing.

(2) A bit address operation unit is provided for bit addressing control in addition to the word address operation unit for word addressing control.

(3) The bit address operation unit operates to add the current operation starting bit address to the bit width of bits to be operated.

(4) The bit address and word address operation units have an interface through the carry signal produced by the bit address operation unit.

(5) The carry signal of the bit address operation unit, when seen from the internal processing control, is an anticipation signal indicating that the current starting bit position will reach beyond the word boundary in the next operation cycle. Accordingly, the carry signal is used to trigger the external memory access for fetching word data necessary for the bit operation at the word boundary.

(6) The bit address and word address operation units, share the hardware components, but have the distinct logical functions relating through the carry signal.

(7) The bit address operation unit operates cyclically in a word period, and its output represents the relative bit address counted from each word boundary.

(8) The carry signal produced at a certain bit position of the bit address operation unit creates the boundary of words of 2^(n) bits.

(9) The bit address operation unit performs addition of a operation bit width for each register independently, allowing bit operations of arbitrary number of bits.

On the other hand, in order for the bit-map display to achieve operation between data of rectangular screen areas, i.e., raster operation, it is necessary to achieve operation between data with any bit width at any position in the data word. For the computation of data with any bit width by a fixed bit-width processor, the absent bit oppositions of input data need to be filled. In simple arithmetic operations, a fixed bit width processor can deal with data with reduced number of bits by filling 0's bits in the lower absent bit positions to obtain a correct result including the carry bit. In the carry adding operation, lower absent bit positions must be filled with 1's bits to obtain a correct result. Logical operations between data are implemented for each corresponding bit separately, and the number of significant bits is arbitrary for the processor to obtain a correct result, except for the flags, which values are correct when absent bit positions are filled with 0's or 1's bits selectively. Accordingly, in carrying out an arithmetic or logic operation for data with arbitrary number of bits, input data are placed at high order bit positions of the processor, with absent bit positions being filled with 0's bits or 1's bits depending on the type of operation, thereby to obtain a completely correct result.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram for explaining the image data processing intended by the present invention;

FIG. 2 is a diagram for explaining the operation of the modification unit (MFY) shown in FIG. 1;

FIG. 3 is a flowchart showing the operational procedure of the modification unit in the conventional system;

FIGS. 4, 5 and 6 are diagrams for explaining the operation shown by the flowchart of FIG. 3;

FIG. 7 is a block diagram showing the bit data processing system embodying the present invention;

FIG. 8 is a table for explaining the bit width of computation by the modification unit;

FIG. 9 is a table for explaining the relation between the carry signal produced by the bit address operation unit and the access timing;

FIG. 10 is a flowchart showing the image processing operation carried out by application of this invention;

FIG. 11 is a flowchart showing part of the process of FIG. 10 conducted by the memory interface unit (MIF);

FIGS. 12, 13 and 14 are diagrams for explaining the operations shown in FIGS. 10 and 11;

FIG. 15 is a block diagram showing in detail the arrangement of the bit operation unit shown in FIG. 7;

FIG. 16 is a table showing the output function of the operand data slicing circuit 3 shown in FIG. 15;

FIG. 17 is a table showing the output function of the operating data slicing circuit 4 shown in FIG. 15;

FIG. 18 is a table listing the types of operation performed by the processing unit 5 shown in FIG. 15;

FIG. 19 is a table showing the merged data R2 shown in FIG. 15;

FIG. 20 is a table showing the writing mask data M shown in FIG. 15;

FIG. 21 is a table showing the output function of the operation result register 7 shown in FIG. 15;

FIG. 22 is an illustration showing the execution of reaster operation on the bit-map display;

FIG. 23 is a diagram for explaining the operation of the bit processing system of the case with displaced bit positions; and

FIG. 24 is a block diagram showing the system configuration for implementing the image processing according to this invention.

DESCRIPTION OF THE PREFERRED EMBODIMENT

In FIG. 24 showing the image processing system, the bit operation processor 110 is controlled by CPU 100 of the host computer to perform image processing such as expansion, reduction, rotation and merging of image patterns, ({circle around (1)}). In implementing the image processing, the bit operation processor 110 fetches data from the image data memory 120, processes the data, {circle around (2)}, and stores the resultant data in the image data memory 120 so that it is displayed on the CRT display unit 130, ({circle around (3)}).

This invention is intended to provide means for implementing the above-mentioned operations ({circle around (2)}, {circle around (3)}) of reading and writing the image data memory 120 for data of any bit width at any positions in the data word.

First, bit position control necessary for bit operation will be described with reference to FIG. 7 showing an embodiment of the inventive bit operation processor.

The arrangement of FIG. 7 includes a word address operation unit ADW, a bit operation unit BOU for accessing data in the aforementioned memory areas M1 and M2 in the image data memory 120, the bit operation unit BOU performing the aforementioned three functions (1), (2) and (3) by rotating the register contents depending on the bit addresses SN and DN of the processing areas X_(A) and X_(B), a bit address operation unit ADB, and a bit address operation unit BR including an operation bit width register WNR for storing the value of operation bit width WN, a source bit address register SNR for storing the operation starting bit position SN for the processing area X_(B) and a destination bit register DNR for storing the operation starting bit position DN for the processing area X_(A). Signal line AC is for the carry produced by the bit address operation unit ADB, signal bus MA is for the word address produced by the word address operation unit ADW, and signal bus D is for word data, through which buses image data is transferred with the memory areas M1 and M2. The bit register unit BR and the bit address operation unit ADB in combination constitute a bit address control unit BM. The contents of the bit register unit BR, i.e., WN, SN and DN, are used by the bit operation unit BOU.

The bit operation unit 110 fetches data from separate memory areas M1 and M2 within the image data memory 120 via the memory interface unit MIF. There are two cases of reading the memory areas M1 and M2 depending on the starting bit position na (nb) of a data segment with a bit width WN to be processed currently in a data word with a bit width of L as follows.

(a) New word data is required for the subsequent operation, in case, ${\frac{{na} + {Wn}}{L} \geqq 1},\quad {\frac{{nb} + {Wn}}{L} \geqq 1}$

(b) Current word data suffices for the subsequent operation, in case, ${\frac{{na} + {Wn}}{L} < 1},\quad {\frac{{nb} + {Wn}}{L} < 1}$

The above conditions are tested on a hardware basis through the provision of an L-bit bit address operation unit ADB for adding values na (nb) and WN, with the decision being made depending on the presence or absence of the carry signal AC from the operation unit ADB. The carry signal AC indicates the need of next word data reading, and it is used to trigger the word address operation unit ADW for addressing the next data word. In this way, the bit operation unit 110 makes access to the memory areas M1 and M2 only when new word data becomes necessary for processing.

In describing operation unit 110, the operation of the bit address control unit BM will first be explained.

The bit address control unit BM operates on the bit address operation unit ADB to add a starting bit address SN (DN) in the register SNR (DNR) to a operation bit width WN in the register WNR to evaluate the starting bit address SN (DN) for the next operation, and stores the result in the register SNR (DNR).

Generally, image processing is conducted between image data in two separate screen areas, and therefore both corresponding memory areas X_(A) and X_(B) under process have distinct starting bit addresses which need to be stored separately, DN in DNR and SN in SNR. The operation bit width WN is constant during the entire process and common to both processing areas X_(A) and X_(B), and it is stored in the single register WNR.

The bit address operation unit ADB is of four bits as mentioned previously, providing a result in the range of (0)_(HEX to (F)) _(HEX). Accordingly, the output of the bit address operation unit ADB represents the bit position between contiguous word boundaries. However, the operation bit width WN used by the bit operation unit BOU requires bit range including (10)_(HEX) beyond the word boundary in addition to (1)_(HEX) to (F)_(HEX). On this account, the bit operation unit BOU is designed to interpret the operation bit width WN as shown by the table of FIG. 8. In this way, the bit address control unit BM calculates the relative bit address within a 16-bit data word.

The word address operation unit ADW operates to increment the word address in response to a signal from the bit address control unit BM. The following describes the interface between the ADW and the bit address operation unit ADB in BM for the word addressing operation. The word address operation unit ADW is notified by the ADB of the overrun of the word boundary by use of the carry signal AC produced by the ADB. However, the value which any of the bit address operation unit ADB and registers WNR, SNR and DNR, which have a capacity of four bits is (0)_(HEX) to (F)_(HEX), therefore, addition of WN and SN (or DN) does not always produce the carry signal AC to meet the purpose. For example, in case of WN=(F)_(HEX′), SN=(0)_(HEX) for the 16-bit operation (see FIG. 8), the following bit address calculation does not create the carry signal AC despite the case that the bit address width reach beyond the word boundary in the next operation.

WN+SN=(F)_(HEX)+(0)_(HEX)=(F)_(HEX)

On this account, the address increment operation by the bit address operation unit ADB must include addition of one so that the carry signal AC is produced as desired, as follows.

(WN+1)+SN=(F)_(HEX) +(1)_(HEX) +(0)_(HEX) =(10)_(HEX)  (4)

As mentioned above, a necessary carry signal AC can be produced by adding “1”, and addition of “1” becomes indispensable.

The aforementioned carry signal AC can be used as a decision signal indicating whether or not the bit position will reach beyond the current word boundary in the next operation cycle. Accordingly, the carry signal AC from the bit address operation unit ADB can be used as, (1) an anticipation signal indicating the need of fetching the next word data, and (2) a trigger signal to the word address operation unit ADW for generating the address of word data to be fetched. In other words, the carry signal AC from the bit address operation unit ADB can be used for timing the access operation of the memory interface unit MIF to the processing areas X_(A) and X_(B), as shown in FIG. 9. Due to separate registers SNR and DNR for storing the starting bit addresses SN and DN, the above-mentioned functions (1) and (2) of the carry signal AC can reflect on the processing areas X_(A) and X_(B) independently.

FIG. 10 shows in flowchart the operation of the foregoing embodiment of this invention applied to the image processing system shown in FIG. 1. In a sequence of operations, a processing step P1 sets the word address B₀ and bit address nb (SN=nb) of SNR for the operation starting bit position for the processing area X_(B), step P2 sets the word address A₀ and bit address na (DN=na) of DNR for the operation starting bit position for the processing area X_(A), step P3 is the function of the bit operation unit BOU, step P4 calculates the next operation starting bit position SN for the processing area X_(B) using the bit address operation unit ADB and word address operation unit ADW, step P5 similarly calculates the next operation starting bit position DN for the processing area X_(A), step XP1 reads a word data in the processing area X_(B), step XP2 writes the operation result in the processing area X_(A), step XP3 reads a word data in the processing area X_(A), step PB1 tests the completion of process for each of rasters Ro-Rm, and steps XB1 and XB2 test the results of executions in the above steps XP1, XP2 and XP3 in accordance with the presence or absence of the carry signal AC.

The decision steps XB1 and XB2 will be explained in more detail in the following.

(1) It is tested as to whether the data segment to be processed next ranges within the current data word or beyond the word boundary.

(2) At the decision step XB1, if the segment is within the current data word (case 1 in FIG. 9), the step XP1 is skipped, or if the segment reaches beyond the word boundary (case 2 in FIG. 9), the step XP1 is executed to read the next word data from the processing area X_(B).

(3) At the decision step XB2, if the segment is within the current word (case 3 in FIG. 9), the steps XP2 and XP3 are skipped, or if the segment reaches beyond the word boundary (case 4 in FIG. 9), the step XP3 is executed to read the next word data from the processing area X_(A).

(4) In case 4, the processing step XP2 for writing the processing area X_(A) is executed by the following reason. The processing area X_(A) is included in the memory area M1 as shown in FIG. 1, and it is also written the result of processing. When the next starting bit position calculated basing on the value of DN reaches beyond the word boundary, it indicates that the operation for one word data has completed.

Namely, the conventional system tests the word boundary condition for fetching the next word data on a software basis, whereas the inventive system employs a bit address operation unit ADB for anticipating the need of memory access, allowing the continuous execution of the internal bit operational process while dealing with external word data.

Decisions made by the steps XB1 and XB2 are based on the carry signal AC produced by the bit address operation unit ADB as described above, and the carry signal AC can readily be distinguished among the four cases shown in FIG. 9 depending on the use of register DNR or SNR. Accordingly, by implementing the decision process for the four cases as shown in FIG. 11 in the memory interface unit MIF, a processing step group X1 including the steps XB1 and XP1 , and a processing step group X2 including the steps XB2, XP2 and XP3, shown in FIG. 10, can be eliminated. In FIG. 11, steps P1-P5 and PB1 are identical to those shown in FIG. 10.

The foregoing operations of four cases are shown in FIGS. 12, 13 and 14, in which initial values are set as: the operation starting bit address SN=(5)_(HEX) and word address Bo for the processing area X_(B); the operation starting bit address DN−(A)_(HEX) and word address Ao for the processing area X_(A); the operation bit width WN=(3)_(HEX). FIG. 12 is for cases 1 and 3, FIG. 13 is for case 4, and FIG. 14 is for case 2 in FIG. 9.

Next, an embodiment of this invention with the intention of fast data processing between rectangular areas on the bit-map display, i.e., raster operation, will be described in connection with FIGS. 15 through 23.

In FIG. 15 showing in detail the bit operation unit BOU in the bit operation processor of FIG. 7, the arrangement includes an operand data register 1, an operating data register 2, an operand data slicing circuit 3, an operating data slicing circuit 4, a processing unit 5, a data merging circuit 6, an operation result register 7, a source bit address register DNR, a destination bit address register SNR, an operation bit width register WNR, a merging address register 11, an operation control register 12, and an operation command decoder 13. In this specification, term “operating data” is used to mean one member of an arithmetic/logic operation, such as X in Z=X+Y, while term “operand data” to mean another member of the operation, such as Y in Z=X+Y. The block diagram further indicates operand data I1, operating data I2, sliced operand data I3, sliced operating data I4, operand data slicing address IS1, operating data slicing address IS2, slicing bit width W, operation result R1, merging address D, writing mask data M, merged data R2, stored result data R3, operation command code FC, and operation decode data F. Fetching of data from the image data memory 120 to the registers 1 and 2, and storing of data from the register 7 in the memory are; conducted by making access to the image data memory 120 through the memory interface unit MIF as shown in FIG. 7.

For the simplicity of the following description on the operation of the above arrangement, the processing unit 5 is assumed to have 4 bits in relation to operand data I1, operating data I2, mask data M and merged data R2 each having 8 bits, twice the operation bit width, sliced operand data I3, sliced operating data I4 and computation result R1 each having 4 bits, identical to the operation bit width, and operand data slicing address IS1, operating data slicing address IS2, slicing bit width W and merging address D each having 2 bits, derived from the 2-bit processing unit 5.

FIG. 16 is the output function table for the operand data slicing circuit 3. In the table, IS1 ₀ and IS1 ₁ are the high-order bit and low-order bit of the operand data slicing address IS1, W₀ and W₁ are the high-order bit and low-order bit of the slicing bit width W, I3 ₀-I3 ₃ are 4-bit sliced operand data (I3 ₀ being highest bit, I3 ₃ lowest), I1 ₀-I1 ₇ are 8-bit operand data (I1 ₀ being highest bit, I1 ₇ lowest), and F is the operation decode data. The operand data slicing circuit 3 produces ‘F’ at I3 ₁-I3 ₃ when W equals to 0 (W₀=0, W₁=0), produces ‘F’ at I3 ₂ and I3 ₃ when W equals to 1 (W₀=0, W₁=1), produces ‘F’ at I3 ₃ when W equals to 2 (W₀=1, W₁=0), and produces an effective data at I3 ₀-I3 ₃ when W equals to 3 (W₀=1, W₁=1).

Namely, the slicing bit width W is actually added by one (W+1), so that the circuit performs slicing of data ranging from 1 bit to 4 bits. The operand slicing address IS1 specifies the highest order bit I1 ₀ through W+1th bit of operand data I1 when IS1 equals to 0 (IS1 ₀=0, IS1 ₁=0), specifies the second bit I1 through W+1th bit of operand data I1 when it is equal to 1 (IS1 ₀=0, IS1 ₁=1), specifies the third bit I1 ₂ and fourth bit I1 ₃ when IS1 equals to 2, and specifies the fourth bit I1 ₃ for slicing when IS1 equals to 3.

FIG. 17 is the output function table for the operating data slicing circuit 4, which operates identically to the operand data slicing circuit 3 with its input and output signals IS1, I1 and I3 being replaced with IS2, I2 and I4.

FIG. 18 a table of operation command codes FC, operation decode data F and types of operations. In the table, symbol A represents an operand data, B represents an operating data, “+” signifies logical sum, “·” signifies logical product, “−” signifies negation, “⊕” signifies exclusive logical sum, “plus” signifies arithmetic addition, “minus” signifies arithmetic subtraction, “carry” represents the value of carry flag, and “borrow” represents the value of borrow flag.

FIGS. 19 and 20 are the output function tables for the merging circuit 6, showing merged data R2 and writing mask data M, respectively. Each signal is suffixed to indicate bit positions in the same way as for the signals in FIGS. 16 and 17. The merged data R2 is not dependent on the slicing bit width W, but is a function of the merging address D and operation result R1. With D being equal to 0, the merged data R2 is given at bit positions R2 ₀-R2 ₃ a 4-bit operation result R1 ₀-R1 ₃; at D=1, R2 ₁-R2 ₄ are given the operation result; at D=2, R2 ₂-R2 ₅ are given the operation result; and at D=3, R2 ₃-R2 ₆ are given the operation result. The remaining bit positions of the merged data R2 are filled with “0”.

Writing mask data M is a function of merging address D and slicing bit width W, as shown in the table of FIG. 20. With the slicing bit width W being 0, writing mask data M has “1” at one bit position and “0” at remaining bit positions. With W=1, data M has “1” at two contiguous bit positions and “0” at remaining bit positions. With W=2, data M has “1” at three consegutive bit positions, and with W=3, data M has “1” at four consequtive bit positions and “0” at remaining bit positions. Bit positions of writing mask data having “1” are determined from the merging address D, i.e., with D=0, W+1 bits from M₀ becomes “1”; with D=1, W+1 bits from M₁ becomes “1”; with D=2, W+1 bits from M₂ become “1”; and with D=3, W+1 bits from M₃ become “1”, with remaining bit positions becoming “0”.

FIG. 21 shows the output function table for the operation result register 7. The 8-bit register 7 provides outputs as a function of merged data R2 and writing mask data M. With bit i of writing mask data M being “0”, i.e., Mi=0, bit i of stored data R3, i.e., R3 _(i), is unchanged, while with Mi being “1”, the R3 _(i) is overwritten by bit i of merged data R2, i.e., R2i, where i takes an arbitrary value ranging 0 through 7.

FIG. 22 illustrates the execution of raster operation on the bit-map display, in which a pair of image data in rectangular areas SA and SB are processed to obtain the result in a rectangular area DST. The bit-map display has a memory which is arranged in the 8-bit or 16-bit word length for reading and writing as in the usual memory. The rectangular data areas SA, SB and DST correspond to bit blocks of memory regardless of word boundaries. Slicing of a bit block within a word or beyond a word is treated by the bit operation processor which operates as shown in FIG. 23.

In this embodiment of the bit operation processor, the operation will be described with the following assumption of settings. The operand data I1 has a starting bit position of IS1=1, operating data I2 has IS2=3, and operation bit width W is 2 bits. The operand data I1 has value ‘100’ on bits 1-3, and operating data I2 has value ‘001’ on bits 3-5.

The operand data slicing circuit 3 responds to the values IS1=1 and W=2 to slice three bits (‘100’) from the operand data I1, and adds “0” following the lowest bit position to form sliced operand data I3. In the same way, the operating data slicing circuit 4 produces sliced operating data I4. The processing unit 5 performs operation between the sliced data I3 and I4, and provides the result R1. In the example of FIG. 23, the processing unit 5 is instructed to execute logical summation for the given data. The merging circuit 6 responds to the values of W and D to merge the high-order 3 bits (‘101’) of the operation result R1 into 3 bits of the stored data R3 starting at bit 3. By the above operations, operand data I1 and operating data I2 are sliced and, after operation between the data, the result is merged into the stored data R3.

Although logical summation has been explained in the above embodiment, other logical operations such as negation (NOT) and logical multiplication (AND) can obviously be executed. For arithmetic operations, when the bit width of operating data is smaller than the operation bit width (4 bits) of the processing unit 5 as in the case of FIG. 23, lower bit(s) are filled with “0”. Arithmetic operations between zeros results in zero without the occurrence of the carry or borrow and does not affect the operation result of high-order bits, and therefore arithmetic operations with less number of bits can be executed. For addition of carry, the operation decode data F becomes 1; and the occurrence of carry is propagated up to the effective bit position, at which the carry bit is added.

Although in the above embodiment the operation decode data F is used only for the sliced operand data IS3, other operation decode data may be used for the sliced operating data IS4 to carry out the execution identically.

As described above, the present invention is effective in controlling the bit position of data for bit operation, as follows.

(1) By addition of a bit address operation unit ADB to the conventional word address operation unit ADW, control of operation between data with different starting bit positions SN and DN in each word data can be simplified.

(2) By using the carry signal AC of the bit address operation unit ADB for incrementing the word address operation unit ADB and by providing registers SNR and DNR separately, the word data memory areas X_(A) and X_(B) can readily be accessed independently of the internal bit processing.

(3) By implementing bit address and word address control and memory access control on a hardware basis, the process can be simplified down to ⅓ or less in terms of processing steps as compared with the conventional system (see FIGS. 3 and 11), whereby speed-up of process is accomplished.

(4) Since execution of operational processes for any number of bits at any bit position in word data can be made, speed-up of bit operation is accomplished. 

What is claimed is:
 1. An image data processing system for processing image data, the image data processing system comprising: an image memory circuit having word boundaries for storing N bit image data at each of a plurality of word address positions, said N bit image data being aligned with word boundaries and said N being an arbitrary integer number; an image data operation processing circuit for processing M bit source image data and M bit destination image data, said M being an arbitrary integer number, and at least said M bit destination image data having a width which straddles at least a consecutive two of said word boundaries; a memory interface circuit for transferring word addresses corresponding to said word address positions of said M bit source image data and said M bit destination image data to said image memory circuit to read/write said N bit image data from/to said image memory circuit, and receiving said N bit image data including at least a part of said M bit source image data and said N bit image data including at least a part of said M bit destination image data to be operated on by said image data operation processing circuit from said image memory circuit; wherein said image data operation processing circuit receives each of said M bit source image data and said M bit destination image data from said memory interface circuit, and aligns a start bit position of said M bit source image data with a start bit position of said M bit destination image data, both of said start bit positions of M bit source image data and M bit destination image data are arbitrary positions in said N bit image data; wherein said image data operation processing circuit executes a logic operation on said M bit source image data aligned with said M bit destination image data, and said M bit destination image data, and transfers said operated on M bit image data to said memory interface circuit for writing said operated on M bit image data to said word address positions at which said N bit image data including at least a part of said M bit destination image data was read by said memory interface circuit; and a display unit for displaying said image data from said image memory circuit.
 2. The image data processing system according to claim 1, wherein said image data is pixel data.
 3. An image data processing system for processing image data, the image data processing system comprising: an image memory circuit having word boundaries for storing N bit image data at each of a plurality of word address positions, said N bit image data being aligned with word boundaries and said N being an arbitrary integer number; an image data operation processing circuit for processing M bit source image data and M bit destination image data, said M being an arbitrary integer number, and at least said M bit destination image data having a width which straddles at least a consecutive two of said word boundaries; a first register circuit for storing bit width information M of said M bit source image data and said M bit destination image data; a second register circuit for storing start bit positions of said M bit source image data and said M bit destination image data; a memory interface circuit coupled to said image memory circuit for generating word addresses corresponding to said word address positions of said M bit source image data and said M bit destination image data to read/write said N bit image data from/to said image memory circuit in accordance with the contents of said first register circuit and said second register circuit, transferring said word addresses to said image memory circuit, and receiving said N bit image data including at least a part of said M bit source image data and said N bit image data including at least a part of said M bit destination image data to be operated on by said image data operation processing circuit from said image memory circuit; wherein said image data operation processing circuit receives each of said M bit source image data and said M bit destination image data from said memory interface circuit, and aligns a start bit position of said M bit source image data with a start bit position of said M bit destination image data, both of said start bit positions of M bit source image data and M bit destination image data are arbitrary positions in said N bit image data; wherein said image data operation processing circuit executes a logic operation on said M bit source image data aligned with said M bit destination image data, and said M bit destination image data, and transfers said operated on M bit image data to said memory interface circuit for writing said operated on M bit image data to said word address positions at which said N bit image data including at least a part of said M bit destination image data was read by said memory interface circuit; and a display unit for displaying said image data from said image memory circuit.
 4. The image data processing system according to claim 3, wherein said image data is pixel data.
 5. An image data processing system for processing image data, the image data processing system comprising: an image memory circuit having word boundaries for storing N bit image data at each of a plurality of word address positions, said N bit image data being aligned with word boundaries and said N being an arbitrary integer number; an image data operation processing circuit for processing M bit source image data and M bit destination image data, said M being an arbitrary integer number, and at least said M bit destination image data having a width which straddles at least a consecutive two of said word boundaries; a memory interface circuit for transferring word addresses corresponding to said word address positions of said M bit source image data and said M bit destination image data to said image memory circuit to access said image memory circuit to read/write said N bit image data from/to said image memory circuit, and receiving said N bit image data including at least a part of said M bit source image data and said N bit image data including at least a part of said M bit destination image data to be operated on by said image data operation processing circuit from said image memory circuit; wherein said image data operation processing circuit receives each of said M bit source image data and said M bit destination image data from said memory interface circuit, and aligns a start bit position of said M bit source image data with a start bit position of said M bit destination image data, both of said start bit positions of M bit source image data and M bit destination image data, are arbitrary positions in said N bit image data; and wherein said image data operation processing circuit executes a logic operation on said M bit source image data aligned with said M bit destination image data, and said M bit destination image data, and transfers said operated on M bit image data to said memory interface circuit for writing said operated on M bit image data to said word address positions at which said N bit image data including at least a part of said M bit destination image data was read by said memory interface circuit; and wherein said memory interface circuit, when performing a read operation, accesses said image memory circuit a number of times equal to the number of word address positions of both of said M bit source image data and said M bit destination image data; and wherein said memory interface circuit when writing a result of said logic operation to said image memory circuit, accesses said image memory circuit a number of times equal to the number of word address positions of said M bit destination image data; and a display unit for displaying said image data from said image memory circuit.
 6. An image data processing system for processing image data, the image data processing system comprising: an image memory circuit having word boundaries for storing N bit image data at each of a plurality of word address positions, said N bit image data being aligned with word boundaries and said N being an arbitrary integer number; an image data operation processing circuit for processing M bit source image data and M bit destination image data, said M being an arbitrary integer number, and at least said M bit destination image data having a width which straddles at least a consecutive two of said word boundaries; a first register circuit for storing width information M of said M bit source image data and said M bit destination image data; a second register circuit for storing start bit positions of said M bit source image data and said M bit destination image data; a memory interface circuit coupled to said image memory circuit for generating word addresses corresponding to said word address positions of said M bit source image data and said M bit destination image data to read/write said N bit image data from/to said image memory circuit in accordance with the contents of said first register circuit and said second register circuit, transferring said word addresses to said image memory circuit, and receiving said N bit image data including at least a part of said M bit source image data and said N bit image data including at least a part of said M bit destination image data to be operated on by said image data operation processing circuit from said image memory circuit; wherein said image data operation processing circuit receives each of said M bit source image data and said M bit destination image data from said memory interface circuit, and aligns a start bit position of said M bit source image data with a start bit position of said M bit destination image data, both of said start bit positions of M bit source image data and M bit destination image data are arbitrary positions in said N bit image data; wherein said image data operation processing circuit executes a logic operation on said M bit source image data aligned with said M bit destination image data, and said M bit destination image data, and transfers said operated on M bit image data to said memory interface circuit for writing said operated on M bit image data to said word address positions at which said N bit image data including at least a part of said M bit destination image data was read by said memory interface circuit; wherein said memory interface circuit, when performing a read operation, accesses said image memory circuit a number of times equal to the number of word address positions of both of said M bit source image data and said M bit destination image data; and wherein said memory interface circuit when writing a result of said logic operation to said image memory circuit, accesses said image memory circuit a number of times equal to the number of word address positions of said M bit destination image data; and a display unit for displaying said image data from said image memory circuit. 